# Low-Memory Inference
Internvl3 38B FP8 Dynamic
MIT
This is the FP8 static quantization version of OpenGVLab/InternVL3-38B, optimized for high-performance inference using vLLM. It achieves approximately 2x acceleration on vision-language tasks with minimal accuracy loss.
Text-to-Image
Safetensors Supports Multiple Languages
I
ConfidentialMind
5,173
1
Nllb 200 Distilled 1.3B Ct2 Int8
NLLB-200 Distilled 1.3B is a neural machine translation model developed by Meta, supporting translation between 200 languages, utilizing CTranslate2 for efficient inference.
Machine Translation
Transformers Supports Multiple Languages

N
OpenNMT
101
10
Featured Recommended AI Models